Action Recognition

Transformers

This project uses Transformer models to recognize human actions from video or textual descriptions. Users can experiment with self-attention mechanisms, multi-head attention, positional encoding, and fine-tuning pre-trained models to classify actions accurately.

View on GitHub